Problems of Scale in Building, Maintaining and Using Very Large Formal Ontologies

نویسنده

  • Douglas B. Lenat
چکیده

Though Cyc is a formal ontology, the process of building it, over the past 22 years, has been a passionately empirical process. We have had several surprises along the way, some of them scientific, some engineering, and some sociological. For instance, the requirement to represent arbitrary pieces of commonsense knowledge led us, in the mid-1980’s, against our intuitions, to move to an increasingly expressive formal representation language. By 1990, we had to admit that the dream of a “Final Encyclopedia” of correct knowledge was a chimera, and what we needed to focus on was a tapestry of locally-consistent “micro-theories” containing contextualized knowledge. Since then, we have begun to work out the fine structure of these micro-theories, their important attributes and ways in which they related to each other, and to appreciate the surprising complexity of the calculi required to formally reason across them. We have also experienced a tipping-point, methodologically, over the past few years, as the ontology has grown large enough to serve as an inductive bias for further knowledge acquisition. I.e., Cyc increasingly actively helps with its own continuing expansion, and by now almost all the activity going on at Cycorp is related to semi-automatic learning from corpora (including the Web) of text and structured sources, whereas as recently as three years ago the majority of the activity here was a cadre of ontological engineers manually writing more axioms to expand the Cyc Knowledge Base. We’ve also developed and used — and in most cases discarded — a series of interfaces, training paradigms, and so on, as the ontology has grown. In the talk, I shall survey what we used, and when, and why we moved on. Most of the reasons have to do with the ontology outgrowing the tools, or increasing variety among the types of users and ontological engineers. Finally, I will discuss some of our ongoing research efforts, and ongoing interface efforts, which are becoming increasingly intermingled — and why that is perhaps inevitable.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Centralized Clustering Method To Increase Accuracy In Ontology Matching Systems

Ontology is the main infrastructure of the Semantic Web which provides facilities for integration, searching and sharing of information on the web. Development of ontologies as the basis of semantic web and their heterogeneities have led to the existence of ontology matching. By emerging large-scale ontologies in real domain, the ontology matching systems faced with some problem like memory con...

متن کامل

A New Play-off Approach in League Championship Algorithm for Solving Large-Scale Support Vector Machine Problems

There are many numerous methods for solving large-scale problems in which some of them are very flexible and efficient in both linear and non-linear cases. League championship algorithm is such algorithm which may be used in the mentioned problems. In the current paper, a new play-off approach will be adapted on league championship algorithm for solving large-scale problems. The proposed algori...

متن کامل

Building Tailored Ontologies from Very Large Knowledge Resources

Nowadays very large domain knowledge resources are being developed in domains like Biomedicine. Users and applications can benefit enormously from these repositories in very different tasks, such as visualization, vocabulary homogenizing and classification. However, due to their large size and lack of formal semantics, they cannot be properly managed and exploited. Instead, it is necessary to p...

متن کامل

Employing Nonlinear Response History Analysis of ASCE 7-16 on a Benchmark Tall Building

ASCE 7-16 has provided a comprehensive platform for the performance-based design of tall buildings. The core of the procedure is based on nonlinear response history analysis of the structure subjected to recorded or simulated ground motions. This study investigates consistency in the ASCE 7-16 requirements regarding the use of different types of ground motions. For this purpose performance of a...

متن کامل

4. Discovering Homecare Services

Future homecare networks will consist of a very wide range of embedded services and software that will often rely on numerous other components to achieve their tasks. They will rarely operate in a self sufficient manner. The ability to discover and use services is not however a trivial task. Services may provide raw data, such as temperature readings, or higher contextual data, such as user act...

متن کامل

Ad-Hoc and Personal Ontologies: A Prototyping Approach to Ontology Engineering

Large scale or common ontologies tend to be developed using structured and formal techniques that can be equated to the Waterfall system development life cycle. However, in domains that are not stable or wellunderstood a prototyping approach may be useful to allow exploration and communication of ideas. Alternatively, the ontology may be part of an intermediate step or representation that provi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006